SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Gotschlich, Emil C. 

(ii) TITLE OF INVENTION: GL YCOS YLTRANS FERAS ES FOR BIOSYNTHESIS OF 
OLIGOSACCHARIDES, AND GENES ENCODING THEM 

(iii) NUMBER OF SEQUENCES: 12 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Klauber & Jackson 

(B) STREET: 411 Hackensack Avenue 

(C) CITY: Hackensack 

(D) STATE: New Jersey 
<E) COUNTRY: USA 

(F) ZIP: 07601 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: 'PC -DOS /MS -DOS 

<D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/312,387 

(B) FILING DATE: September 26, 1994 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Jackson Esq., David A. 

(B) REGISTRATION NUMBER: 26,742 

(C) REFERENCE /DOCKET NUMBER: 600-1-09 5B 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 201 487-5800 

(B) TELEFAX: 201 343-1684 

(C) TELEX: 133521 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5859 base 'pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrheae 



(B) STRAIN: F62 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..381 

(C) GENE: glys (glycyl tRNA syntetase beta chain) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 445.. 1491 

(C) GENE: lgtA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2342.. 3262 
<C) GENE: lgtC 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 3322.. 4335 

(C) GENE: lgtD 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 4354.. 5196 

(C) GENE: lgtE 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CTG CAG GCC GTC GCC GTA TTC AAA CAA CTG CCC GAA GCC GCC GCG CTC 
Leu Gin Ala Val Ala Val Phe Lys Gin Leu Pro Glu Ala Ala Ala Leu 
15 10 15 

GCC GCC GCC AAC AAA CGC GTG CAA AAC CTG CTG AAA AAA GCC GAT GCC 
Ala Ala Ala Asn Lys Arg Val Gin Asn Leu Leu Lys Lys Ala Asp Ala 
20 25 30 

GCG TTG GGC GAA GTC AAT GAA AGC CTG CTG CAA CAG GAC GAA GAA AAA 
Ala Leu Gly Glu Val Asn Glu Ser Leu Leu Gin Gin Asp Glu Glu Lys 
35 40 45 

GCC CTG TAC GCT GCC GCG CAA GGT TTG CAG CCG AAA ATT GCC GCC GCC 
Ala Leu Tyr Ala Ala Ala Gin Gly Leu Gin Pro Lys He Ala Ala Ala 
50 55 60 

GTC GCC GAA GGC AAT TTC CGA ACC GCC TTG TCC GAA CTG GCT TCC GTC 
Val Ala Glu Gly Asn Phe Arg Thr Ala Leu Ser Glu Leu Ala Ser Val 
65 70 75 80 

AAG CCG CAG GTT GAT GCC TTC TTC GAC GGC GTG ATG GTG ATG GCG GAA 
Lys Pro Gin Val Asp Ala Phe Phe Asp Gly Val Met Val Met Ala Glu 
85 90 95 

GAT GCC GCC GTA AAA CAA AAC CGC CTG AAC CTG CTG AAC CGC TTG GCA 
Asp Ala Ala Val Lys Gin Asn Arg Leu Asn Leu Leu Asn Arg Leu Ala 
100 105 110 

GAG CAG ATG AAC GCG GTG GCC GAC ATC GCG CTT TTG GGC GAG TAACCGTTGT 
Glu Gin Met Asn Ala Val Ala Asp He Ala Leu Leu Gly Glu 
115 120 125 
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ACAGTCCAAA TGCCGTCTGA AGCCTTCAGG CGGCATCAAA TTATCGGGAG AGTAAA 444 

TTG CAG CCT TTA GTC AGC GTA TTG ATT TGC GCC TAC AAC GTA GAA AAA 492 
Met Gin Pro Leu Val Ser Val Leu He Cys Ala Tyr Asn Val Glu Lys 
1 5 10 ^ 15 

TAT TTT GCC CAA TCA TTA GCC GCC GTC GTG AAT CAG ACT TGG CGC AAC 540 
Tyr Phe Ala Gin Ser Leu Ala Ala Val Val Asn Gin Thr Trp Arg Asn 
20 25 30 

TTG GAT ATT TTG ATT GTC GAT GAC GGC TCG ACA GAC GGC ACA CTT GCC 588 
Leu Asp He Leu He Val Asp Asp Gly Ser Thr Asp Gly Thr Leu Ala 
35 40 45 

ATT GCC AAG GAT TTT CAA AAG CGG GAC AGC CGT ATC AAA ATC CTT GCA 636 
He Ala Lys Asp Phe Gin Lys Arg Asp Ser Arg He Lys He Leu Ala 
50 55 60 

CAA GCT CAA AAT TCC GGC CTG ATT CCC TCT TTA AAC ATC GGG CTG GAC 684 
Gin Ala Gin Asn Ser Gly Leu He Pro Ser Leu Asn lie Gly Leu Asp 
65 70 75 80 

GAA TTG GCA AAG TCG GGG GGG GGG GGG GGG GAA TAT ATT GCG CGC ACC 732 
Glu Leu Ala Lys Ser Gly Gly Gly Gly Gly Glu Tyr He Ala Arg Thr 
85 ~ 90 95 



Q GAT GCC GAC GAT ATT GCC TCC CCC GGC TGG ATT GAG AAA ATC GTG GGC 780 

f*j Asp Ala Asp Asp He Ala Ser Pro Gly Trp He Glu Lys He Val Gly 

3;;; 100 105 no 

SI GAG ATG GAA AAA GAC CGC AGC ATC ATT GCG ATG GGC GCG TGG CTG GAA 828 

fj 'j Glu Met Glu Lys Asp Arg Ser He He Ala Met Gly Ala Trp Leu Glu 
tif 115 120 125 

\| GTT TTG TCG GAA GAA AAG GAC GGC AAC CGG CTG GCG CGG CAC CAC AAA 876 

Val Leu Ser Glu Glu Lys Asp Gly Asn Arg Leu Ala Arg His His Lys 
* 13 0 135 14 0 

f»i| CAC GGC AAA ATT TGG AAA AAG CCG ACC CGG CAC GAA GAC ATC GCC GCC 924 

l!:: His Gly Lys He Trp Lys Lys Pro Thr Arg His Glu Asp He Ala Ala 

U 145 150 155 ~ 160 

f,U 

l5!! ;: TTT TTC CCT TTC GGC AAC CCC ATA CAC AAC AAC ACG ATG ATT ATG CGG 972 

to Phe Phe Pro Phe Gly Asn Pro He His Asn Asn Thr Met He Met Arg 
H» 165 170 175 

CGC AGC GTC ATT GAC GGC GGT TTG CGT TAC GAC ACC GAG CGG GAT TGG 1020 
Arg Ser Val lie Asp Gly Gly Leu Arg Tyr Asp Thr Glu Arg Asp Trp 
180 JL85 190 

GCG GAA GAT TAC CAA TTT TGG TAC GAT GTC AGC AAA TTG GGC AGG CTG 1068 
Ala Glu Asp Tyr Gin Phe Trp Tyr Asp Val Ser Lys Leu Gly Arg Leu 
195 200 205 

GCT TAT TAT CCC GAA GCC TTG GTC AAA TAC CGC CTT CAC GCC AAT CAG 1116 
Ala Tyr Tyr Pro Glu Ala Leu Val Lys Tyr Arg Leu His Ala Asn Gin 
210 215 220 

GTT TCA TCC AAA CAC AGC GTC CGC CAA CAC GAA ATC GCG CAA GGC ATC 1164 
Val Ser Ser Lys His Ser Val Arg Gin His Glu He Ala Gin Gly He 
225 230 235 240 



Ait 



CAA AAA ACC GCC AGA AAC GAT TTT TTG CAG TCT 
Gin Lys Thr Ala Arg Asn Asp Phe Leu Gin Ser 
245 250 

CGG TTC GAC AGC CTA GAA TAC CGC CAA ACA AAA 
Arg Phe Asp Ser Leu Glu Tyr Arg Gin Thr Lys 
260 265 

CTG CCG GAG AAG GAT TTG CCG GAA GAA GAT TTT 
Leu Pro Glu Lys Asp Leu Pro Glu Glu Asp Phe 
275 280 



ATG GGT TTT AAA ACC 
Met Gly Phe Lys Thr 
255 

GCA GCG GCG TAT GAA 
Ala Ala Ala Tyr Glu 
270 

GAA CGC GCC CGC CGG 
Glu Arg Ala Arg Arg 
285 
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1212 

1260 

1308 



TTT TTG TAC CAA TGC TTC AAA CGG ACG GAC ACG 
Phe Leu Tyr Gin Cys Phe Lys Arg Thr Asp Thr 
290 295 



CCG CCC TCC GGC GCG 
Pro Pro Ser Gly Ala 
300 



1356 



TGG CTG GAT TTC GCG GCA GAC GGC AGG ATG AGG 
Trp Leu Asp Phe Ala Ala Asp Gly Arg Met Arg 
305 310 . - " 315 

AGG CAA TAC TTC GGC ATT TTG TAC CGG CTG ATT 
Arg Qln Tyr Phe Gly lie Leu Tyr Arg Leu lie 
325 330 



CGG CTG TTT ACC TTG 
Arg Leu Phe Thr Leu 
320 

AAA AAC CGC CGG CAG 
Lys Asn Arg Arg Gin 
335 



1404 



1452 



n 



m 



n 



GCG CGG TCG GAT TCG GCA GGG AAA GAA CAG GAG 
Ala Arg Ser Asp Ser Ala Gly Lys Glu Gin Glu 
340 345 



ATT TAATGCAAAA 
He 



1498 



CCACGTTATC 


AGCTTGGCTT 


CCGCCGCAGA 


ACGCAGGGCG 


CACATTGCCG 


CAACCTTCGG 


1558 


CAGTCGCGGC 


ATCCCGTTCC 


AGTTTTTCGA 


CGCACTGATG 


CCGTCTGAAA 


GGCTGGAACG 


1618 


GGCAATGGCG 


GAACTCGTCC 


CCGGCTTGTC 


GGCGCACCCC 


TATTTGAGCG 


GAGTGGAAAA 


1678 


AGCCTGCTTT 


ATGAGC CACG 


CCGTATTGTG 


GGAACAGGCA 


TTGGACGAAG 


GCGTACCGTA 


1738 


TATCGCCGTA 


TTTGAAGATG 


ATGTCTTACT 


CGGCGAAGGC 


GCGGAGCAGT 


TCCTTGCCGA 


1798 


AGATACTTGG 


CTGCAAGAAC 


GCTTTGACCC 


CGATTCCGCC 


TTTGTCGTCC 


GCTTGGAAAC 


1858 


GATGTTTATG 


CACGTCCTGA 


CCTCGCCCTC 


CGGCGTGGCG 


GACTACGGCG 


GGCGCGCCTT 


1918 


TCCGCTTTTG 


GAAAGCGAAC 


ACTGCGGGAC 


GGCGGGCTAT 


ATTATTTCCC 


GAAAGGCGAT 


1978 


GCGTTTTTTC 


TTGGACAGGT 


TTGCCGTTTT 


GCCGCCCGAA 


CGCCTGCACC 


CTGTCGATTT 


2038 


GATGATGTTC 


GGCAACCCTG 


ACGACAGGGA 


AGGAATGCCG 


GTTTGCCAGC 


TCAATCCCGC 


2098 


CTTGTGCGCC 


CAAGAGCTGC 


ATTATGCCAA 


GTTTCACGAC 


CAAAACAGCG 


CATTGGGCAG 


2158 


CCTGATCGAA 


CATGACCGCC 


GCCTGAACCG 


CAAACAGCAA 


TGGCGCGATT 


CCCCCGCCAA 


2218 


CACATTCAAA 


CACCGCCTGA 


TCCGCGCCTT 


GACCAAAATC 


GGCAGGGAAA 


GGGAAAAACG 


2278 


CCGGCAAAGG 


CGCGAACAGT 


TAATCGGCAA 


GATTATTGTG 


CCTTTCCAAT 


AAAAGGAGAA 


2338 


AAG ATG GAC 
Met Asp 


ATC GTA TTT GCG GCA GAC GAC AAC 
He Val Phe Ala Ala Asp Asp Asn 


TAT GCC GCC TAC CTT 
Tyr Ala Ala Tyr Leu 


2386 



15 10 

TGC GTT GCG GCA AAA AGC GTG GAA GCG GCC CAT 
Cys Val Ala Ala Lys Ser Val Glu Ala Ala His 



15 

CCC GAT ACG GAA ATC 
Pro Asp Thr Glu He 



2434 



20 



25 



30 



AGG TTC CAC GTC CTC GAT GCC GGC ATC AGT GAG GAA AAC CGG GCG GCG 
Arg Phe His Val Leu Asp Ala Gly He Ser Glu Glu Asn Arg Ala Ala 
35 40 45 

GTT GCC GCC AAT TTG CGG GGG GGG GGT AAT ATC CGC TTT ATA GAC GTA 
Val Ala Ala Asn Leu Arg Gly Gly Gly Asn He Arg Phe He Asp Val 
50 55 60 

AAC CCC GAA GAT TTC GCC GGC TTC CCC TTA AAC ATC AGG CAC ATT TCC 
Asn Pro Glu Asp Phe Ala Gly Phe Pro Leu Asn He Arg His He Ser 
65 70 75 

ATT ACG ACT TAT GCC CGC CTG AAA TTG GGC GAA TAC ATT GCC GAT TGC 
He Thr Thr Tyr Ala Arg Leu Lys Leu Gly Glu Tyr He Ala Asp Cys 
80 85 90 95 

GAC AAA GTC CTG TAT CTG GAT ACG GAC GTA TTG GTC AGG GAC GGC CTG 
Asp Lys Val Leu Tyr Leu Asp Thr Asp Val Leu Val Arg Asp Gly Leu 
100 105 "* no 

AAG CCC TTA TGG GAT ACC GAT TTG ~GGC GGT AAC TGG GTC GGC GCG TGC 
Lys Pro Leu Trp Asp Thr Asp Leu Gly Gly Asn Trp Val Gly Ala Cys 
115 120 125 

ATC GAT TTG TTT GTC GAA AGG CAG GAA GGA TAC AAA CAA AAA ATC GGT 
He Asp Leu Phe Val Glu Arg Gin Glu Gly Tyr Lys Gin Lys He Gly 
130 135 ' 140 

ATG GCG GAC GGA GAA TAT TAT TTC AAT GCC GGC GTA TTG CTG ATC AAC 
Met Ala Asp Gly Glu Tyr Tyr Phe Asn Ala Gly Val Leu Leu He Asn 
145 150 155 

CTG AAA AAG TGG CGG CGG CAC GAT ATT TTC AAA ATG TCC TGC GAA TGG 
Leu Lys Lys Trp Arg Arg His Asp He Phe Lys Met Ser Cys Glu Trp 
160 165 170 ' 175 

GTG GAA CAA TAC AAG GAC GTG ATG CAA TAT CAG GAT CAG GAC ATT TTG 
Val Glu Gin Tyr Lys Asp Val Met Gin Tyr Gin Asp Gin Asp He Leu 
180 185 190 

AAC GGG CTG TTT AAA GGC GGG GTG TGT TAT GCG AAC AGC CGT TTC AAC 
Asn Gly Leu Phe Lys Gly Gly Val Cys Tyr Ala Asn Ser Arg Phe Asn 
195 200 205 

TTT ATG CCG ACC AAT TAT GCC TTT ATG GCG AAC GGG TTT GCG TCC CGC 
Phe Met Pro Thr Asn Tyr Ala Phe Met Ala Asn Gly Phe Ala Ser Arg 
210 215 _ 220 

CAT ACC GAC CCG CTT TAC CTC GAC CGT ACC AAT ACG GCG ATG CCC GTC 
His Thr Asp Pro Leu Tyr Leu Asp Arg Thr Asn Thr Ala Met Pro Val 
225 230 235 

GCC GTC AGC CAT TAT TGC GGC TCG GCA AAG CCG TGG CAC AGG GAC TGC 
Ala Val Ser His Tyr Cys Gly Ser Ala Lys Pro Trp His Arg Asp Cys 
240 245 250 255 

ACC GTT TGG GGT GCG GAA CGT TTC ACA GAG TTG GCC GGC AGC CTG ACG 
Thr Val Trp Gly Ala Glu Arg Phe Thr Glu Leu Ala Gly Ser Leu Thr 
260 265 270 
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ACC GTT CCC GAA GAA TGG CGC GGC AAA CTT GCC GTC CCG CCG ACA AAG 3202 
Thr Val Pro Glu Glu Trp Arg Gly Lys Leu Ala Val Pro Pro Thr Lys 
275 280 285 

TGT ATG CTT CAA AGA TGG CGC AAA AAG CTG TCT GCC AGA TTC TTA CGC 3250 
Cys Met Leu Gin Arg Trp Arg Lys Lys Leu Ser Ala Arg Phe Leu Arg 
290 295 300 

AAG ATT TAT TGACGGGGCA GGCCGTCTGA AGCCTTCAGA CGGCATCGGA 3299 
Lys lie Tyr 
305 

CGTATCGGAA AGGAGAAACG GA TTG CAG CCT TTA GTC AGC GTA TTG ATT TGC 3351 

Met Gin Pro Leu Val Ser Val Leu He Cys 
15 10 

GCC TAC AAC GCA GAA AAA TAT TTT GCC CAA TCA TTG GCC GCC GTA GTG 3399 
Ala Tyr Asn Ala Glu Lys Tyr Phe Ala Gin Ser Leu Ala Ala Val Val 
15 20 25 

GGG CAG ACT TGG CGC AAC TTG GAT ATT TTG ATT GTC GAT GAC GGC TCG 3447 
Gly Gin Thr Trp Arg Asn Leu Asp He Leu He Val Asp Asp Gly Ser 
30 " 35 40 

ACG GAC GGC ACG CCC GCC ATT GCC CGG CAT TTC CAA GAA CAG GAC GGC 3495 
M Thr Asp Gly Thr Pro Ala He Ala Arg His Phe Gin Glu Gin Asp Gly 

45 50 55 



a 
o 



fij 



yj 



AGG ATC AGG ATA ATT TCC AAT CCC CGC AAT TTG GGC TTT ATC GCC TCT 3543 
Arg He Arg He He Ser Asn Pro Arg Asn Leu Gly Phe He Ala Ser 
60 65 70 



TTA AAC ATC GGG CTG GAC GAA TTG GCA AAG TCG GGG GGG GGG GAA TAT 3591 
0] Leu Asn He Gly Leu Asp Glu Leu Ala Lys Ser Gly Gly Gly Glu Tyr 

75 80 85 * 90 



ATT GCG CGC ACC GAT GCC GAC GAT ATT GCC TCC CCC GGC TGG ATT GAG 363 9 

He Ala Arg Thr Asp Ala Asp Asp He Ala Ser Pro Gly Trp He Glu 
95 100 105 



Q AAA ATC GTG GGC GAG ATG GAA AAA GAC CGC AGC ATC ATT GCG ATG GGC 368 7 

Lys He Val Gly Glu Met Glu Lys Asp Arg Ser He He Ala Met Gly 
110 115 120 



GCG TGG TTG GAA GTT TTG TCG GAA GAA AAC AAT AAA AGC GTG CTT GCC 3735 
Ala Trp Leu Glu Val Leu Ser Glu Glu 'Asn Asn Lys Ser Val Leu Ala 
125 130 135 

GCC ATT GCC CGA AAC GGC GCA ATT TGG GAC AAA CCG ACC CGG CAT GAA 3783 
Ala He Ala Arg Asn Gly Ala Ile'Trp Asp Lys Pro Thr Arg His Glu 
140 145 150 

GAC ATT GTC GCC GTT TTC CCT TTC GGC AAC CCC ATA CAC AAC AAC ACG 3831 
Asp He Val Ala Val Phe Pro Phe Gly Asn Pro He His Asn Asn Thr 
155 160 165 170 

ATG ATT ATG AGG CGC AGC GTC ATT GAC GGC GGT TTG CGG TTC GAT CCA 3879 
Met He Met Arg Arg Ser Val He Asp Gly Gly Leu Arg Phe Asp Pro 
175 180 " 185 

GCC TAT ATC CAC GCC GAA GAC TAT AAG TTT TGG TAC GAA GCC GGC AAA 3927 
Ala Tyr He His Ala Glu Asp Tyr Lys Phe Trp Tyr Glu Ala Gly Lys 
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190 195 200 

CTG GGC AGG CTG GCT TAT TAT CCC GAA GCC TTG GTC AAA TAC CGC TTC 3975 
Leu Gly Arg Leu Ala Tyr Tyr Pro Glu Ala Leu Val Lys Tyr Arg Phe 
205 210 215 

CAT CAA GAC CAG ACT TCT TCC AAA TAC AAC CTG CAA CAG CGC AGG ACG 4023 
His Gin Asp Gin Thr Ser Ser Lys Tyr Asn Leu Gin Gin Arg Arg Thr 
220 225 230 

GCG TGG AAA ATC AAA GAA GAA ATC AGG GCG GGG TAT TGG AAG GCG GCA 4071 
Ala Trp Lys lie Lys Glu Glu lie Arg Ala Gly Tyr Trp Lys Ala Ala 
235 240 245 * 250 

GGC ATA GCC GTC GGG GCG GAC TGC CTG AAT TAC GGG CTT TTG AAA TCA 4119 
Gly He Ala Val Gly Ala Asp Cys Leu Asn Tyr Gly Leu Leu Lys Ser 
255 260 265 

ACG GCA TAT GCG TTG TAC GAA AAA GCC TTG TCC GGA CAG GAT ATC GGA 4167 
- - - Thr Ala Tyr Ala Leu Tyr Glu Lys Ala Leu Ser Gly Gin Asp He Gly 

270 275 280 

TGC CTC CGC CTG TTC CTG TAC GAA 'TAT TTC TTG TCG TTG GAA AAG TAT 4215 
Cys Leu Arg Leu Phe Leu Tyr Glu Tyr Phe Leu Ser Leu Glu Lys Tyr 
M 285 290 295 

O TCT TTG ACC GAT TTG CTG GAT TTC TTG ACA GAC CGC GTG ATG AGG AAG 4263 

|p Ser Leu Thr Asp Leu Leu Asp Phe Leu Thr Asp Arg Val Met Arg Lys 

f^x 300 305 310 

\! CTG TTT GCC GCA CCG CAA TAT AGG AAA ATC CTG AAA AAA ATG TTA CGC 4311 

f|] Leu Phe Ala Ala Pro Gin Tyr Arg Lys He Leu Lys Lys Met Leu Arg 

315 320 325 * 330 



m 



S\ CCT TGG AAA TAC CGC AGC TAT TGAAACCGAA CAGGATAAAT C ATG CAA AAC 4362 

Pro Trp Lys Tyr Arg Ser Tyr Met Gin Asn 

pi; 

fij CAC GTT ATC AGC TTG GCT TCC GCC GCA GAG CGC AGG GCG CAC ATT GCC 4410 

w His Val He Ser Leu Ala Ser Ala Ala Glu Arg Arg Ala His lie Ala 

H 5 10 15 

f'\ GAT ACC TTC GGC AGT CGC GGC ATC CCG TTC CAG TTT TTC GAC GCA CTG 4458 

T ! Asp Thr Phe Gly Ser Arg Gly He Pro Phe Gin Phe Phe Asp Ala Leu 

20 25 30 " 35 

ATG CCG TCT GAA AGG CTG GAA CAG GCG ATG GCG GAA CTC GTC CCC GGC 4506 
Met Pro Ser Glu Arg Leu Glu Gin Ala Met Ala Glu Leu Val Pro Gly 
40 45 50 

TTG TCG GCG CAC CCC TAT TTG AGC GGA GTG GAA AAA GCC TGC TTT ATG 4554 
Leu Ser Ala His Pro Tyr Leu Ser Gly Val Glu Lys Ala Cys Phe Met 
55 60 65 

AGC CAC GCC GTA TTG TGG GAA CAG GCG TTG GAT GAA GGT CTG CCG TAT 4602 
Ser His Ala Val Leu Trp Glu Gin Ala Leu Asp Glu Gly Leu Pro Tyr 
70 75 80 

ATC GCC GTA TTT GAG GAC GAC GTT TTA CTC GGC GAA GGC GCG GAG CAG 4650 
He Ala Val Phe Glu Asp Asp Val Leu Leu Gly Glu Gly Ala Glu Gin 
85 90 95 




ru 



fi 
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4698 

4746 

4794 

4842 

4890 

4938 



TTC CTT GCC GAA GAT ACT TGG TTG GAA GAG CGT TTT GAC AAG GAT TCC 
Phe Leu Ala Glu Asp Thr Trp Leu Glu Glu Arg Phe Asp Lys Asp Ser 
100 105 no 115 

GCC TTT ATC GTC CGT TTG GAA ACG ATG TTT GCG AAA GTT ATT GTC AGA 
Ala Phe He Val Arg Leu Glu Thr Met Phe Ala Lys Val He Val Arg 
120 125 130 

CCG GAT AAA GTC CTG AAT TAT GAA AAC CGG TCA TTT CCT TTG CTG GAG 
Pro Asp Lys Val Leu Asn Tyr Glu Asn Arg Ser Phe Pro Leu Leu Glu 
135 140 145 

AGC GAA CAT TGT GGG ACG GCT GGC TAT ATC ATT TCG CGT GAG GCG ATG 
Ser Glu His Cys Gly Thr Ala Gly Tyr He He Ser Arg Glu Ala Met 
150 155 160 

CGG TTT TTC TTG GAC AGG TTT GCC GTT TTG CCG CCA GAG CGG ATT AAA 
Arg Phe Phe Leu Asp Arg Phe Ala Val Leu Pro Pro Glu Arg He Lys 
165 170 175 

GCG GTA GAT TTG ATG ATG TTT ACT TAT TTC TTT GAT AAG GAG GGG ATG 
Ala Val Asp Leu Met Met Phe Thr Tyr Phe Phe Asp Lys Glu Gly Met 
180 185 190 195 

CCT GTT TAT CAG GTT AGT CCC GCC TTA TGT ACC CAA GAA TTG CAT TAT 4986 
Pro Val Tyr Gin Val Ser Pro Ala Leu Cys Thr Gin Glu Leu His Tyr 
200 205 210 

GCC AAG TTT CTC AGT CAA AAC AGT ATG TTG GGT AGC GAT TTG GAA AAA 5034 
Ala Lys Phe Leu Ser Gin Asn Ser Met Leu Gly Ser Asp Leu Glu Lys 
215 220 225 

GAT AGG GAA CAA GGA AGA AGA CAC CGC CGT TCG TTG AAG GTG ATG TTT 5082 
Asp Arg Glu Gin Gly Arg Arg His Arg Arg Ser Leu Lys Val Met Phe 
230 235 240 

GAC TTG AAG CGT GCT TTG GGT AAA TTC GGT AGG GAA AAG AAG AAA AGA 5130 
Asp Leu Lys Arg Ala Leu Gly Lys Phe Gly Arg Glu Lys Lys Lys Arg 
245 250 255 

ATG GAG CGT CAA AGG CAG GCG GAG CTT GAG AAA GTT TAC GGC AGG CGG 5178 
Met Glu Arg Gin Arg Gin Ala Glu Leu Glu Lys Val Tyr Gly Arg Arg 
260 265 270 275 

GTC ATA TTG TTC AAA TAGTTTGTGT AAAATATAGG GGATTAAAAT CAGAAATGGA 5233 
Val He Leu Phe Lys 
280 



CACACTGTCA 


TTCCCGCGCA 


GGCGGGAATQ 


TAGGTCTTTA 


AACTTCGGTT 


TTTTCCGATA 


5293 


AATTCTTGCC 


GGATTAAAAT 


TCCAGATTCC 


CGCTTTCGCG 


GGGATGACGG 


CGGGGGGATT 


5353 


GTTGCTTTTT 


CGGATAAAAT 


CCCGTGTTTT 


TTCATCTGCT 


AGGTAAAATC 


GCCCCAAAGC 


5413 


GTCTGCATCG 


CGGCGATGGC 


GGCGAGTGGG 


GCGGTTTCTG 


TGCGTAAAAT 


CCGTTTTCCG 


5473 


AGTGTAACCG 


CCTGAAAGCC 


GGCTTCAAAT 


GCCTGTTGTT 


CTTCCTGTTC 


TGTCCAGCCG 


5533 


CCTTCGGGCC 


CGACCATAAA 


GACGATTGCG 


CCGGACGGGT 


GGCGGATGTC 


GCCGAGTTTG 


5593 


CAGGCGCGGT 


TGATGCTCAT 


AATCAGCTTG 


GTGTTTTCAG 


ACGGCATTTT 


GTCGAGTGCT 


5653 



I 
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TCACGGTAGC CGATGATGGG CAGTACGGGG GGAACGGTGT TCCTGCCGCT TTGTTCGCAC 5713 
GCGGAGATGA CGATTTCCTG CCAGCGTGCG AGGCGTTTGG CGGCGCGTTC TCCGTCGAGG 5773 
CGGACGATGC AGCGTTCGCT GATGACGGGC TGTATGGCGG TTACGCCGAG TTCGACGCTT 5833 
TTTTGCAGGG TGAAATCCAT GCGATC 58 5 g 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Leu Gin Ala Val Ala Val Phe Lys Gin Leu Pro Glu Ala Ala Ala Leu 
-1 5 10 15 

Ala Ala Ala Asn Lys Arg Val Gin Asn Leu Leu Lys Lys Ala Asp Ala 
20 25 30 

Ala Leu Gly Glu Val Asn Glu Ser Leu Leu Gin Gin Asp Glu Glu Lys 
35 40 45 



Ala Leu Tyr Ala Ala Ala Gin Gly Leu Gin Pro Lys He Ala Ala Ala 

? « 50 55 60 

fll 

m Val Ala Glu Gly Asn Phe Arg Thr Ala Leu Ser Glu Leu Ala Ser Val 

65 70 75 80 



Lys Pro Gin Val Asp Ala Phe Phe Asp Gly Val Met Val Met Ala Glu 
85 90 95 

Asp Ala Ala Val Lys Gin Asn Arg Leu Asn Leu Leu Asn Arg Leu Ala 

100 105 no 

Glu Gin Met Asn Ala Val Ala Asp He Ala Leu Leu Gly Glu 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 348 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Met Gin Pro Leu Val Ser Val Leu He Cys Ala Tyr Asn Val Glu Lys 
1 5 10 15 

Tyr Phe Ala Gin Ser Leu Ala Ala Val Val Asn Gin Thr Trp Arg Asn 
20 25 30 



Leu Asp lie Leu lie Val Asp Asp Gly Ser Thr Asp Gly Thr Leu Ala 
35 40 45 

lie Ala Lys Asp Phe Gin Lys Arg Asp Ser Arg He Lys He Leu Ala 
50 55 60 

Gin Ala Gin Asn Ser Gly Leu He Pro Ser Leu Asn He Gly Leu Asp 
65 70 75 80 

Glu Leu Ala Lys Ser Gly Gly Gly Gly Gly Glu Tyr He Ala Arg Thr 
85 90 95 

Asp Ala Asp Asp He Ala Ser Pro Gly Trp He Glu Lys He Val Gly 
100 105 lio 

Glu Met Glu Lys Asp Arg Ser He He Ala Met Gly Ala Trp Leu Glu 
115 120 125 

Val Leu Ser Glu Glu Lys Asp Gly Asn Arg Leu Ala Arg His His Lys 
130 135 140 

His Gly Lys He Trp Lys Lys Pro Thr Arg His Glu Asp He Ala Ala 
145 150 155 160 

Phe Phe Pro Phe Gly Asn Pro He His Asn Asn Thr Met He Met Arg 
165 170 175 

Arg Ser Val He Asp Gly Gly Leu Arg Tyr Asp Thr Glu Arg Asp Trp 
180 185 190 

Ala Glu Asp Tyr Gin Phe Trp Tyr Asp Val Ser Lys Leu Gly Arg Leu 
195 200 205 

Ala Tyr Tyr Pro Glu Ala Leu Val Lys Tyr Arg Leu His Ala Asn Gin 
210 215 220 

Val Ser Ser Lys His Ser Val Arg Gin His Glu lie Ala Gin Gly He 
225 230 235 240 

Gin Lys Thr Ala Arg Asn Asp Phe Leu Gin Ser Met Gly Phe Lys Thr 
245 250 * 255 

Arg Phe Asp Ser Leu Glu Tyr Arg Gin Thr Lys Ala Ala Ala Tyr Glu 
260 265 270 

Leu Pro Glu Lys Asp Leu Pro Glu Glu' Asp Phe Glu Arg Ala Arg Arg 
275 280 285 

Phe Leu Tyr Gin Cys Phe Lys Arg Thr Asp Thr Pro Pro Ser Gly Ala 
290 295 300 

Trp Leu Asp Phe Ala Ala Asp Gly Arg Met Arg Arg Leu Phe Thr Leu 
305 310 315 320 

Arg Gin Tyr Phe Gly He Leu Tyr Arg Leu He Lys Asn Arg Arg Gin 
325 330 335 



Ala Arg Ser Asp Ser Ala Gly Lys Glu Gin Glu He 
340 345 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 306 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Asp lie Val Phe Ala Ala Asp Asp Asn Tyr Ala Ala Tyr Leu Cys 
1 5 10 15 

Val Ala Ala Lys Ser Val Glu Ala Ala His Pro Asp Thr Glu lie Arg 
20 25 30 

Phe His Val Leu Asp Ala Gly He Ser Glu Glu Asn Arg Ala Ala Val 
35 40 45 

Ala Ala Asn Leu Arg Gly Gly Gly Asn He Arg Phe He Asp Val Asn 
.50 55 60 

Pro Glu Asp Phe Ala Gly Phe Pro Leu Asn He Arg His He Ser He 
65 70 75 " 80 

Thr Thr Tyr Ala Arg Leu Lys Leu Gly Glu Tyr He Ala Asp Cys Asp 
85 90 95 

Lys Val Leu Tyr Leu Asp Thr Asp Val Leu Val Arg Asp Gly Leu Lys 
100 105 110 

Pro Leu Trp Asp Thr Asp Leu Gly Gly Asn Trp Val Gly Ala Cys He 
115 120 125 

Asp Leu Phe Val Glu Arg Gin Glu Gly Tyr Lys Gin Lys He Gly Met 
130 135 140 

Ala Asp Gly Glu Tyr Tyr Phe Asn Ala Gly Val Leu Leu He Asn Leu 
145 150 155 160 

Lys Lys Trp Arg Arg His Asp He Phe Lys Met Ser Cys Glu Trp Val 
165 170 175 

Glu Gin Tyr Lys Asp Val Met Gin Tyr Gin Asp Gin Asp He Leu Asn 
180 185* 190 

Gly Leu Phe Lys Gly Gly Val Cys Tyr Ala Asn Ser Arg Phe Asn Phe 
195 200 . 205 

Met Pro Thr Asn Tyr Ala Phe Met Ala Asn Gly Phe Ala Ser Arg His 
210 215 220 

Thr Asp Pro Leu Tyr Leu Asp Arg Thr Asn Thr Ala Met Pro Val Ala 
225 230 235 240 

Val Ser His Tyr Cys Gly Ser Ala Lys Pro Trp His Arg Asp Cys Thr 
245 25'0 *~ ~ 255 

Val Trp Gly Ala Glu Arg Phe Thr Glu Leu Ala Gly Ser Leu Thr Thr 
260 265 270 



Val Pro Glu Glu Trp Arg Gly Lys Leu Ala Val Pro Pro Thr Lys Cys 
275 280 285 

Met Leu Gin Arg Trp Arg Lys Lys Leu Ser Ala Arg Phe Leu Arg Lys 
290 295 300 

lie Tyr 
305 



(2) INFORMATION FOR SEQ ID NO: 5: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Gin Pro Leu Val Ser Val Leu "lie Cys Ala Tyr Asn Ala Glu Lys 
15 10 15 

Tyr Phe Ala Gin Ser Leu Ala Ala Val Val Gly Gin Thr Trp Arg Asn 
20 25 30 

Leu Asp lie Leu lie Val Asp Asp Gly Ser Thr Asp Gly Thr Pro Ala 
35 40 45 

lie Ala Arg His Phe Gin Glu Gin Asp Gly Arg lie Arg lie lie Ser 
50 55 60 

Asn Pro Arg Asn Leu Gly Phe lie Ala Ser Leu Asn lie Gly Leu Asp 
65 70 75 ' 80 

Glu Leu Ala Lys Ser Gly Gly Gly Glu Tyr lie Ala Arg Thr Asp Ala 
85 90 ~* 95 

Asp Asp lie Ala Ser Pro Gly Trp lie Glu Lys lie Val Gly Glu Met 
100 105 110 

Glu Lys Asp Arg Ser lie lie Ala Met Gly Ala Trp Leu Glu Val Leu 
115 120 * 125 

Ser Glu Glu Asn Asn Lys Ser Val Leu Ala Ala lie Ala Arg Asn Gly 
130 135 140 

Ala lie Trp Asp Lys Pro Thr Arg*His Glu Asp lie Val Ala Val Phe 
145 150 155 160 

Pro Phe Gly Asn Pro lie His Asn Asn Thr Met lie Met Arg Arg Ser 
165 170 175 

Val He Asp Gly Gly Leu Arg Phe Asp Pro Ala Tyr He His Ala Glu 
180 185 190 

Asp Tyr Lys Phe Trp Tyr Glu Ala Gly Lys Leu Gly Arg Leu Ala Tyr 
195 200 205 

Tyr Pro Glu Ala Leu Val Lys Tyr Arg Phe His Gin Asp Gin Thr Ser 



210 215 220 

Ser Lys Tyr Asn Leu Gin Gin Arg Arg Thr Ala Trp Lys lie Lys Glu 
225 230 235 240 

Glu He Arg Ala Gly Tyr Trp Lys Ala Ala Gly He Ala Val Gly Ala 
245 250 255 

Asp Cys Leu Asn Tyr Gly Leu Leu Lys Ser Thr Ala Tyr Ala Leu Tyr 
260 265 270 

Glu Lys Ala Leu Ser Gly Gin Asp He Gly Cys Leu Arg Leu Phe Leu 
275 280 285 

Tyr Glu Tyr Phe Leu Ser Leu Glu Lys Tyr Ser Leu Thr Asp Leu Leu 
290 295 300 

Asp Phe Leu Thr Asp Arg Val Met Arg Lys Leu Phe Ala Ala Pro Gin 
305 310 315 320 

Tyr Arg Lys He Leu Lys Lys Met Leu Arg Pro Trp Lys Tyr Arg Ser 
_ 325 _ 330 335 

Tyr 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 280 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Met Gin Asn His Val He Ser Leu Ala Ser Ala Ala Glu Arg Arg Ala 
1 5 10 15 

His He Ala Asp Thr Phe Gly Ser Arg Gly He Pro Phe Gin Phe Phe 
20 25 30 

Asp Ala Leu Met Pro Ser Glu Arg Leu 'Glu Gin Ala Met Ala Glu Leu 
35 40 45 

Val Pro Gly Leu Ser Ala His Pro Tyr Leu Ser Gly Val Glu Lys Ala 
50 55 60 

Cys Phe Met Ser His Ala Val Leu Trp Glu Gin Ala Leu Asp Glu Gly 
65 70 75 80 

Leu Pro Tyr He Ala Val Phe Glu Asp Asp Val Leu Leu Gly Glu Gly 
85 90 * 95 

Ala Glu Gin Phe Leu Ala Glu Asp Thr Trp Leu Glu Glu Arg Phe Asp 
100 105 * 110 

Lys Asp Ser Ala Phe lie Val Arg Leu Glu Thr Met Phe Ala Lys Val 
115 120 125 



He Val Arg Pro Asp Lys Val Leu Asn Tyr Glu Asn Arg Ser Phe Pro 
130 135 140 

Leu Leu Glu Ser Glu His Cys Gly Thr Ala Gly Tyr He He Ser Arg 
145 150 155 160 

Glu Ala Met Arg Phe Phe Leu Asp Arg Phe Ala Val Leu Pro Pro Glu 
165 170 175 

Arg He Lys Ala Val Asp Leu Met Met Phe Thr Tyr Phe Phe Asp Lys 
180 185 190 

Glu Gly Met Pro Val Tyr Gin Val Ser Pro Ala Leu Cys Thr Gin Glu 
195 200 205 

Leu His Tyr Ala Lys Phe Leu Ser Gin Asn Ser Met Leu Gly Ser Asp 
210 215 220 

Leu Glu Lys Asp Arg Glu Gin Gly Arg Arg His Arg Arg Ser Leu Lys 
225 230 235 240 

Val Met Phe Asp Leu Lys Arg Ala Leu Gly Lys Phe Gly Arg Glu Lys 
245 250 255 

Lys Lys Arg Met Glu Arg Gin Arg Gin Ala Glu Leu Glu Lys Val Tyr 
260 265 270 

Gly Arg Arg Val He Leu Phe Lys 
275 280 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5859 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrheae 

(B) STRAIN: F62 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1491.. 2330 

(C) GENE: lgtB 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CTGCAGGCCG TCGCCGTATT CAAACAACTG CCCGAAGCCG CCGCGCTCGC CGCCGCCAAC 
AAACGCGTGC AAAACCTGCT GAAAAAAGCC GATGCCGCGT TGGGCGAAGT CAATGAAAGC 
CTGCTGCAAC AGGACGAAGA AAAAGCCCTG TACGCTGCCG CGCAAGGTTT GCAGCCGAAA 
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n 

P 
fi 

Vi 

ni 



hi 



ATTGCCGCCG 


CCGTCGCCGA 


AGGCAATTTC 


CGAACCGCCT 


TGTCCGAACT 


GGCTTCCGTC 


240 


AAGCCGCAGG 


TTGATGCCTT 


CTTCGACGGC 


GTGATGGTGA 


TGGCGGAAGA 


TGCCGCCGTA 


300 


AAACAAAACC 


GCCTGAACCT 


GCTGAACCGC 


TTGGCAGAGC 


AGATGAACGC 


GGTGGCCGAC 


360 


ATCGCGCTTT 


TGGGCGAGTA 


ACCGTTGTAC 


AGTCCAAATG 


CCGTCTGAAG 


CCTTCAGGCG 


420 


GCATCAAATT 


ATCGGGAGAG 


TAAATTGCAG 


CCTTTAGTCA 


GCGTATTGAT 


TTGCGCCTAC 


480 


AACGTAGAAA 


AATATTTTGC 


CCAATCATTA 


GCCGCCGTCG 


TGAATCAGAC 


TTGGCGCAAC 


540 


TTGGATATTT 


TGATTGTCGA 


TGACGGCTCG 


ACAGACGGCA 


CACTTGCCAT 


TGCCAAGGAT 


600 


TTTCAAAAGC 


GGGACAGCCG 


TATCAAAATC 


CTTGCACAAG 


CTCAAAATTC 


CGGCCTGATT 


660 


CCCTCTTTAA 


ACATCGGGCT 


GGACGAATTG 


GCAAAGTCGG 


GGGGGGGGGG 


GGGGGAATAT 


720 


ATTGCGCGCA 


CCGATGCCGA 


CGATATTGCC 


TCCCCCGGCT 


GGATTGAGAA 


AATCGTGGGC 


780 


GAGATGGAAA 


AAGACCGCAG 


CATCATTGCG 


ATGGGCGCGT 


GGCTGGAAGT 


TTTGTCGGAA 


840 


GAAAAGGACG 


GCAACCGGCT 


GGCGCGGCAC 


CACAAACACG 


GCAAAATTTG 


GAAAAAGCCG 


900 


ACCCGGCACG 


AAGACATCGC 


CGCCTTTTTC 


CCTTTCGGCA 


ACCCCATACA 


CAACAACACG 


960 


ATGATTATGC 


GGCGCAGCGT 


CATTGACGGC 


GGTTTGCGTT 


ACGACACCGA 


GCGGGATTGG 


1020 


GCGGAAGATT 


ACCAATTTTG 


GTACGATGTC 


AGCAAATTGG 


GCAGGCTGGC 


TTATTATCCC 


1080 


GAAGCCTTGG 


TCAAATACCG 


CCTTCACGCC 


AATCAGGTTT 


CATCCAAACA 


CAGCGTCCGC 


1140 


CAACACGAAA 


TCGCGCAAGG 


CATCCAAAAA 


ACCGCCAGAA 


ACGATTTTTT 


GCAGTCTATG 


1200 


GGTTTTAAAA 


CCCGGTTCGA 


CAGCCTAGAA 


TACCGCCAAA 


CAAAAGCAGC 


GGCGTATGAA 


1260 


CTGCCGGAGA 


AGGATTTGCC 


GGAAGAAGAT 


TTTGAACGCG 


CCCGCCGGTT 


TTTGTACCAA 


1320 


TGCTTCAAAC 


GGACGGACAC 


GCCGCCCTCC 


GGCGCGTGGC 


TGGATTTCGC 


GGCAGACGGC 


1380 


AGGATGAGGC 


GGCTGTTTAC 


CTTGAGGCAA 


TACTTCGGCA 


TTTTGTACCG 


GCTGATTAAA 


1440 


AACCGCCGGC 


AGGCGCGGTC 


GGATTCGGCA 


GGGAAAGAAC 


AGGAGATTTA 


ATG CAA 


1496 



Met Gin 
1 



AAC CAC GTT ATC AGC TTG GCT TCC GCC GCA GAA 
Asn His Val lie Ser Leu Ala Ser Ala Ala Glu 
5 10 



CGC AGG GCG CAC ATT 
Arg Arg Ala His lie 
15 



1544 



GCC GCA ACC TTC GGC AGT CGC GGC ATC CCG TTC 
Ala Ala Thr Phe Gly Ser Arg Gly lie Pro Phe 
20 25 



CAG TTT TTC GAC GCA 
Gin Phe Phe Asp Ala 
30 



1592 



CTG ATG CCG TCT GAA AGG CTG GAA CGG GCA ATG 
Leu Met Pro Ser Glu Arg Leu Glu Arg Ala Met 
35 40 45 



GCG GAA CTC GTC CCC 
Ala Glu Leu Val Pro 
50 



1640 



GGC TTG TCG GCG CAC CCC TAT TTG AGC GGA GTG 
Gly Leu Ser Ala His Pro Tyr Leu Ser Gly Val 
55 60 



GAA AAA GCC TGC TTT 
Glu Lys Ala Cys Phe 
65 



1688 
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ATG AGC CAC GCC GTA TTG TGG GAA CAG GCA TTG GAC GAA GGC GTA CCG 1736 
Met Ser His Ala Val Leu Trp Glu Gin Ala Leu Asp Glu Gly Val Pro 
70 75 80 

TAT ATC GCC GTA TTT GAA GAT GAT GTC TTA CTC GGC GAA GGC GCG GAG 1784 
Tyr He Ala Val Phe Glu Asp Asp Val Leu Leu Gly Glu Gly Ala Glu 
85 90 95 

CAG TTC CTT GCC GAA GAT ACT TGG CTG CAA GAA CGC TTT GAC CCC GAT 1832 
Gin Phe Leu Ala Glu Asp Thr Trp Leu Gin Glu Arg Phe Asp Pro Asp 
100 105 110 

TCC GCC TTT GTC GTC CGC TTG GAA ACG ATG TTT ATG CAC GTC CTG ACC 1880 
Ser Ala Phe Val Val Arg Leu Glu Thr Met Phe Met His Val Leu Thr 
115 120 125 130 

TCG CCC TCC GGC GTG GCG GAC TAC GGC GGG CGC GCC TTT CCG CTT TTG 1928 
Ser Pro Ser Gly Val Ala Asp Tyr Gly Gly Arg Ala Phe Pro Leu Leu 
135 140 145 

GAA AGC GAA CAC TGC GGG ACG GCG GGC TAT ATT ATT TCC CGA AAG GCG 1976 
Glu Ser Glu His Cys Gly Thr Ala Gly Tyr He He Ser Arg Lys Ala 
" " 150 '155 160 

ATG CGT TTT TTC TTG GAC AGG TTT GCC GTT TTG CCG CCC GAA CGC CTG 2024 
Met Arg Phe Phe Leu Asp Arg Phe Ala Val Leu Pro Pro Glu Arg Leu 
^ 165 170 175 



CAC CCT GTC GAT TTG ATG ATG TTC GGC AAC CCT GAC GAC AGG GAA GGA 2072 
His Pro Val Asp Leu Met Met Phe Gly Asn Pro Asp Asp Arg Glu Gly 
Q 180 185 190 



SI 



ATG CCG GTT TGC CAG CTC AAT CCC GCC TTG TGC GCC CAA GAG CTG CAT 2120 
Met Pro Val Cys Gin Leu Asn Pro Ala Leu Cys Ala Gin Glu Leu His 
195 200 205 210 



TAT GCC AAG TTT CAC GAC CAA AAC AGC GCA TTG GGC AGC CTG ATC GAA 2168 
s Tyr Ala Lys Phe His Asp Gin Asn Ser Ala Leu Gly Ser Leu He Glu 

y t 215 220 225 



?Je CAT GAC CGC CGC CTG AAC CGC AAA CAG CAA TGG CGC GAT TCC CCC GCC 2216 

Q His Asp Arg Arg Leu Asn Arg Lys Gin Gin Trp Arg Asp Ser Pro Ala 

M 230 235 240 

Q AAC ACA TTC AAA CAC CGC CTG ATC CGC GCC TTG ACC AAA ATC GGC AGG 2264 

y s Asn Thr Phe Lys His Arg Leu He Arg -Ala Leu Thr Lys He Gly Arg 

245 250 255 

GAA AGG GAA AAA CGC CGG CAA AGG CGC GAA CAG TTA ATC GGC AAG ATT 2312 
Glu Arg Glu Lys Arg Arg Gin Arg "Arg Glu Gin Leu He Gly Lys He 
260 265 270 

ATT GTG CCT TTC CAA TAAAAGGAGA AAAGATGGAC ATCGTATTTG CGGCAGACGA 2367 
He Val Pro Phe Gin 
275 280 

CAACTATGCC GCCTACCTTT GCGTTGCGGC AAAAAGCGTG GAAGCGGCCC ATCCCGATAC 2427 

GGAAATCAGG TTCCACGTCC TCGATGCCGG CATCAGTGAG GAAAACCGGG CGGCGGTTGC 2487 

CGCCAATTTG CGGGGGGGGG GTAATATCCG CTTTATAGAC GTAAACCCCG AAGATTTCGC 2547 



CGGCTTCCCC 


TTAAACATCA 


GGCACATTTC 


CGAATACATT 


GCCGATTGCG 


ACAAAGTCCT 


CGGCCTGAAG 


CCCTTATGGG 


ATACCGATTT 


TTTGTTTGTC 


GAAAGGCAGG 


AAGGATACAA 


TTATTTCAAT 


GCCGGCGTAT 


TGCTGATCAA 


CAAAATGTCC 


TGCGAATGGG 


TGGAACAATA 


CATTTTGAAC 


GGGCTGTTTA 


AAGGCGGGGT 


GCCGACCAAT 


TATGCCTTTA 


TGGCGAACGG 


CCTCGACCGT 


ACCAATACGG 


CGATGCCCGT 


GCCGTGGCAC 


AGGGACTGCA 


CCGTTTGGGG 


CCTGACGACC 


GTTCCCGAAG 


AATGGCGCGG 


GCTTCAAAGA 


TGGCGCAAAA 


AGCTGTCTGC 


CAGGCCGTCT 


GAAGCCTTCA 


GACGGCATCG 



Q CCTTTAGTCA GCGTATTGAT TTGCGCCTAC 

GCCGCCGTAG TGGGGCAGAC TTGGCGCAAC 

f tf i 

%\ ACGGACGGCA CGCCCGCCAT TGCCCGGCAT 

«W ATTTCCAATC CCCGCAATTT GGGCTTTATC 

01 

£j GCAAAGTCGG GGGGGGGGGA ATATATTGCG 

a GGCTGGATTG AGAAAATCGT GGGCGAGATG 

GCGTGGTTGG AAGTTTTGTC GGAAGAAAAC 

s U 

CI AACGGCGCAA TTTGGGACAA ACCGACCCGG 

I . i 

GGCAACCCCA TACACAACAA CACGATGATT 
jU CGGTTCGATC CAGCCTATAT CCACGCCGAA 

CTGGGCAGGC TGGCTTATTA TCCCGAAGCC 
ACTTCTTCCA AATACAACCT GCAACAGCGC 
AGGGCGGGGT ATTGGAAGGC GGCAGGCATA 
CTTTTGAAAT CAACGGCATA TGCGTTGTAC 
TGCCTCCGCC TGTTCCTGTA CGAATATTTC 
TTGCTGGATT TCTTGACAGA CCGCGTGATG 
AAAATCCTGA AAAAAATGTT ACGCCCTTGG 
TAAATCATGC AAAACCACGT TATCAGCTTG 
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CATTACGACT 


TATGCCCGCC 


TGAAATTGGG 


2607 


GTATCTGGAT 


ACGGACGTAT 


TGGTCAGGGA 


2667 


GGGCGGTAAC 


TGGGTCGGCG 


CGTGCATCGA 


2727 


ACAAAAAATC 


GGTATGGCGG 


ACGGAGAATA 


2787 


CCTGAAAAAG 


TGGCGGCGGC 


ACGATATTTT 


2847 


CAAGGACGTG 


ATGCAATATC 


AGGATCAGGA 


2907 


GTGTTATGCG 


AACAGCCGTT 


TCAACTTTAT 


2967 


GTTTGCGTCC 


CGCCATACCG 


ACCCGCTTTA 


3027 


CGCCGTCAGC 


CATTATTGCG 


GCTCGGCAAA 


3087 


TGCGGAACGT 


TTCACAGAGT 


TGGCCGGCAG 


3147 


CAAACTTGCC 


GTCCCGCCGA 


CAAAGTGTAT 


3207 


CAGATTCTTA 


CGCAAGATTT 


ATTGACGGGG 


3267 


GACGTATCGG 


AAAGGAGAAA 


CGGATTGCAG 


3327 


AACGCAGAAA 


AATATTTTGC 


CCAATCATTG 


3387 


TTGGATATTT 


TGATTGTCGA 


TGACGGCTCG 


3447 


TTCCAAGAAC 


AGGACGGCAG 


GATCAGGATA 


3507 


GCCTCTTTAA 


ACATCGGGCT 


GGACGAATTG 


3567 


CGCACCGATG 


CCGACGATAT 


TGCCTCCCCC 


3627 


GAAAAAGACC 


GCAGCATCAT 


TGCGATGGGC 


3687 


AATAAAAGCG 


TGCTTGCCGC 


CATTGCCCGA 


3747 


CATGAAGACA 


TTGTCGCCGT 


TTTCCCTTTC 


3807 


ATGAGGCGCA 


GCGTCATTGA 


CGGCGGTTTG 


3867 


GACTATAAGT 


TTTGGTACGA 


AGCCGGCAAA 


3927 


TTGGTCAAAT 


ACCGCTTCCA 


TCAAGACCAG 


3987 


AGGACGGCGT 


GGAAAATCAA 


AGAAGAAATC 


4047 


GCCGTCGGGG 


CGGACTGCCT 


GAATTACGGG 


4107 


GAAAAAGCCT 


TGTCCGGACA 


GGATATCGGA 


4167 


TTGTCGTTGG 


AAAAGTATTC 


TTTGACCGAT 


4227 


AGGAAGCTGT 


TTGCCGCACC 


GCAATATAGG 


4287 


AAATACCGCA 


GCTATTGAAA 


CCGAACAGGA 


4347 


GCTTCCGCCG 


CAGAGCGCAG 


GGCGCACATT 


4407 



a 

M 



GCCGATACCT TCGGCAGTCG CGGCATCCCG TTCCAGTTTT TCGACGCACT GATGCCGTCT 
GAAAGGCTGG AACAGGCGAT GGCGGAACTC GTCCCCGGCT TGTCGGCGCA CCCCTATTTG 
AGCGGAGTGG AAAAAGCCTG CTTTATGAGC CACGCCGTAT TGTGGGAACA GGCGTTGGAT 
GAAGGTCTGC CGTATATCGC CGTATTTGAG GACGACGTTT TACTCGGCGA AGGCGCGGAG 
CAGTTCCTTG CCGAAGATAC TTGGTTGGAA GAGCGTTTTG ACAAGGATTC CGCCTTTATC 
GTCCGTTTGG AAACGATGTT TGCGAAAGTT ATTGTCAGAC CGGATAAAGT CCTGAATTAT 
GAAAACCGGT CATTTCCTTT GCTGGAGAGC GAACATTGTG GGACGGCTGG CTATATCATT 
TCGCGTGAGG CGATGCGGTT TTTCTTGGAC AGGTTTGCCG TTTTGCCGCC AGAGCGGATT 
AAAGCGGTAG ATTTGATGAT GTTTACTTAT TTCTTTGATA AGGAGGGGAT GCCTGTTTAT 
CAGGTTAGTC CCGCCTTATG TACCCAAGAA TTGCATTATG CCAAGTTTCT CAGTCAAAAC 
AGTATGTTGG GTAGCGATTT GGAAAAAGAT AGGGAACAAG GAAGAAGACA CCGCCGTTCG 
TTGAAGGTGA TGTTTGACTT GAAGCGTGCT TTGGGTAAAT TCGGTAGGGA AAAGAAGAAA 
AGAATGGAGC GTCAAAGGCA GGCGGAGCTT GAGAAAGTTT ACGGCAGGCG GGTCATATTG 
TTCAAATAGT TTGTGTAAAA TATAGGGGAT TAAAATCAGA AATGGACACA CTGTCATTCC 
CGCGCAGGCG GGAATCTAGG TCTTTAAACT TCGGTTTTTT CCGATAAATT CTTGCCGCAT 
TAAAATTCCA GATTCCCGCT TTCGCGGGGA TGACGGCGGG GGGATTGTTG CTTTTTCGGA 
TAAAATCCCG TGTTTTTTCA TCTGCTAGGT AAAATCGCCC CAAAGCGTCT GCATCGCGGC 
GATGGCGGCG AGTGGGGCGG TTTCTGTGCG TAAAATCCGT TTTCCGAGTG TAACCGCCTG 
AAAGCCGGCT TCAAATGCCT GTTGTTCTTC CTGTTCTGTC CAGCCGCCTT CGGGCCCGAC 
CATAAAGACG ATTGCGCCGG ACGGGTGGCG GATGTCGCCG AGTTTGCAGG CGCGGTTGAT 
GCTCATAATC AGCTTGGTGT TTTCAGACGG CATTTTGTCG AGTGCTTCAC GGTAGCCGAT 
GATGGGCAGT ACGGGGGGAA CGGTGTTCCT GCCGCTTTGT TCGCACGCGG AGATGACGAT 
TTCCTGCCAG CGTGCGAGGC GTTTGGCGGC GCGTTCTCCG TCGAGGCGGA CGATGCAGCG 
TTCGCTGATG ACGGGCTGTA TGGCGGTTAC GCCGAGTTCG ACGCTTTTTT GCAGGGTGAA 
ATCCATGCGA TC 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
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4467 

4527 

4587 

4647 

4707 

4767 

4827 

4887 

4947 

5007 

5067 

5127 

5187 

5247 

5307 

5367 

5427 

5487 

5547 

5607 

5667 

5727 

5787 

5847 
5859 



Met Gin Asn His Val He Ser Leu Ala Ser Ala Ala Glu Arg Arg Ala 
1 5 10 15 

His He Ala Ala Thr Phe Gly Ser Arg Gly He Pro Phe Gin Phe Phe 
20 25 30 

Asp Ala Leu Met Pro Ser Glu Arg Leu Glu Arg Ala Met Ala Glu Leu 
35 40 45 

Val Pro Gly Leu Ser Ala His Pro Tyr Leu Ser Gly Val Glu Lvs Ala 
50 55 60 

Cys Phe Met Ser His Ala Val Leu Trp Glu Gin Ala Leu Asp Glu Glv 
65 70 ' 75 80 

Val Pro Tyr He Ala Val Phe Glu Asp Asp Val Leu Leu Gly Glu Gly 
85 90 9 5 

Ala Glu Gin Phe Leu Ala Glu Asp Thr Trp Leu Gin Glu Arg Phe Asp 
100 105 no 

Pro Asp Ser Ala Phe Val Val Arg Leu Glu Thr Met Phe Met His Val 
115 120 * 125 

Leu Thr Ser Pro Ser Gly Val Ala Asp Tyr Gly Gly Arg Ala Phe Pro 
130 135 140 

Leu Leu Glu Ser Glu His Cys Gly Thr Ala Gly Tyr He He Ser Arg 
145 150 155 160 

Lys Ala Met Arg Phe Phe Leu Asp Arg Phe Ala Val Leu Pro Pro Glu 
165 170 175 

Arg Leu His Pro Val Asp Leu Met Met Phe Gly Asn Pro Asp Asp Arg 
180 185 190 

Glu Gly Met Pro Val Cys Gin Leu Asn Pro Ala Leu Cys Ala Gin Glu 
195 200 205 

Leu His Tyr Ala Lys Phe His Asp Gin Asn Ser Ala Leu Gly Ser Leu 
210 215 220 

He Glu His Asp Arg Arg Leu Asn Arg Lys Gin Gin Trp Arg Asp Ser 
225 2 30 235 *" 240 

Pro Ala Asn Thr Phe Lys His Arg Leu -He Arg Ala Leu Thr Lys He 
245 250 255 

Gly Arg Glu Arg Glu Lys Arg Arg Gin Arg Arg Glu Gin Leu He Gly 
260 '265 270 

Lys He He Val Pro Phe Gin 
275 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

<iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PCR primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GCCGAGAAAA CTATTGGTGG A 21 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

_ (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PCR primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AAAACATGCA GGAATTGACG AT 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 348 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Leu Gin Pro Leu Val Ser Val Leu He Cys Ala Tyr Asn Val Glu Lys 
1 5 ^ io 15 

Tyr Phe Ala Gin Ser Leu Ala Ala Val Val Asn Gin Thr Trp Arg Asn 
20 25 30 

Leu Asp He Leu He Val Asp Asp Gly Ser Thr Asp Gly Thr Leu Ala 
35 40 45 

He Ala Lys Asp Phe Gin Lys Arg Asp Ser Arg He Lys He Leu Ala 
50 55 60 

Gin Ala Gin Asn Ser Gly Leu He Pro Ser Leu Asn He Gly Leu Asp 
65 70 75 80 



Glu Leu Ala Lys Ser Gly Gly Gly Gly Gly Glu Tyr He Ala Arg Thr 
85 90 * 95 

Asp Ala Asp Asp He Ala Ser Pro Gly Trp He Glu Lys He Val Gly 
100 105 110 

Glu Met Glu Lys Asp Arg Ser He He Ala Met Gly Ala Trp Leu Glu 
115 120 125 

Val Leu Ser Glu Glu Lys Asp Gly Asn Arg Leu Ala Arg His His Lys 
130 135 "* 140 

His Gly Lys He Trp Lys Lys Pro Thr Arg His Glu Asp He Ala Ala 
145 150 155 * 160 

Phe Phe Pro Phe Gly Asn Pro lie His Asn Asn Thr Met He Met Arg 
165 170 175 

Arg Ser Val He Asp Gly Gly Leu Arg Tyr Asp Thr Glu Arg Asp Trp 
180 185 190 

Ala Glu Asp Tyr Gin Phe Trp Tyr Asp Val Ser Lys Leu Gly Arg Leu 
195 200 ' 205 

Ala Tyr Tyr Pro Glu Ala Leu Val Lys Tyr Arg Leu His Ala Asn Gin 
210 215 220 

Val Ser Ser Lys His Ser Val Arg Gin His Glu He Ala Gin Gly lie 
225 230 235 240 

Gin Lys Thr Ala Arg Asn Asp Phe Leu Gin Ser Met Gly Phe Lys Thr 
245 250 255 

Arg Phe Asp Ser Leu Glu Tyr Arg Gin Thr Lys Ala Ala Ala Tyr Glu 
260 265 270 

Leu Pro Glu Lys Asp Leu Pro Glu Glu Asp Phe Glu Arg Ala Arg Arg 
275 280 285 

Phe Leu Tyr Gin Cys Phe Lys Arg Thr Asp Thr Pro Pro Ser Gly Ala 
290 295 300 

Trp Leu Asp Phe Ala Ala Asp Gly Arg Met Arg Arg Leu Phe Thr Leu 
305 310 315 320 

Arg Gin Tyr Phe Gly He Leu Tyr Arg Leu He Lys Asn Arg Arg Gin 
325 330 335 

Ala Arg Ser Asp Ser Ala Gly Lys Glu Gin Glu He 
340 * "345 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Leu Gin Pro Leu Val 
1 5 



Ser Val Leu lie Cys Ala Tyr Asn Ala Glu Lys 
10 15 



Tyr Phe Ala Gin Ser 
20 



Leu Ala Ala Val Val Gly Gin Thr Trp Arg Asn 
25 30 



Leu Asp lie Leu lie 
35 



Val Asp Asp Gly Ser Thr Asp Gly Thr Pro Ala 
40 45 



lie Ala Arg His Phe 
50 



Gin Glu Gin Asp Gly Arg lie Arg lie lie Ser 
55 60 



Asn Pro Arg Asn Leu 
65 



Gly Phe lie Ala Ser Leu Asn He Gly Leu Asp 
70 75 80 



Glu Leu Ala Lys Ser 
85 



Gly Gly Gly Glu Tyr He Ala Arg Thr Asp Ala 
90 95 



Asp Asp He Ala Ser 
100 



Pro Gly Trp He Glu Lys He Val Gly Glu Met 
105 110 



Glu Lys Asp Arg Ser 

lis 



He He Ala Met Gly Ala Trp Leu Glu Val Leu 
120 * 125 



Ser Glu Glu Asn Asn 
130 



Lys Ser Val Leu Ala Ala He Ala Arg Asn Gly 
135 140 



Ala He Trp Asp Lys 

145 



Pro Thr Arg His Glu Asp He Val Ala Val Phe 
150 155 160 



Pro Phe Gly Asn Pro 
165 



He His Asn Asn Thr Met He Met Arg Arg Ser 
170 175 



Val He Asp Gly Gly 
180 



Leu Arg Phe Asp Pro Ala Tyr He His Ala Glu 
185 190 



Asp Tyr Lys Phe Trp 
195 



Tyr Glu Ala Gly Lys Leu Gly Arg Leu Ala Tyr 
200 " 205 



Tyr Pro Glu Ala Leu 
210 



Val Lys Tyr Arg Phe His Gin Asp Gin Thr Ser 
215 ~ 220 



Ser Lys Tyr Asn Leu 
225 



Gin Gin Arg Arg Thr Ala Trp Lys He Lys Glu 
230 235 240 



Glu He Arg Ala Gly 
245 



Tyr Trp Lys Ala -Ala Gly He Ala Val Gly Ala 
250 255 



Asp Cys Leu Asn Tyr 
260 



Gly Leu Leu Lys Ser Thr Ala Tyr Ala Leu Tyr 
"265 270 



Glu Lys Ala Leu Ser 
275 



Gly Gin Asp He Gly Cys Leu Arg Leu Phe Leu 
280 285 



Tyr Glu Tyr Phe Leu 
290 



Ser Leu Glu Lys Tyr Ser Leu Thr Asp Leu Leu 
295 300 



Asp Phe Leu Thr Asp 
305 



Arg Val Met Arg Lys Leu Phe Ala Ala Pro Gin 
310 * 315 320 



Tyr Arg Lys He Leu 
325 



Lys Lys Met Leu Arg Pro Trp Lys Tyr Arg Ser 
330 335 
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